AITopics | song lyric

Collaborating Authors

song lyric

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

German court rules against OpenAI in copyright case

The Japan TimesNov-12-2025, 02:52:00 GMT

The Munich court found that OpenAI, the maker of ChatGPT, was not entitled to use song lyrics to train its artificial intelligence without licenses, and that the artists who wrote them are entitled to compensation. The Munich court found that the maker of ChatGPT was not entitled to use song lyrics to train its artificial intelligence without licenses, and that the artists who wrote them are entitled to compensation. In a time of both misinformation and too much information, quality journalism is more crucial than ever. By subscribing, you can help us get the story right. With your current subscription plan you can comment on stories.

large language model, machine learning, natural language, (16 more...)

The Japan Times

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.48)
Asia > Japan > Honshū (0.21)

Industry:

Leisure & Entertainment (1.00)
Law (1.00)
Media > News (0.71)
Government > Regional Government > Europe Government > Germany Government (0.43)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.64)

Add feedback

ChatGPT violated copyright law by 'learning' from song lyrics, German court rules

The GuardianNov-11-2025, 17:00:39 GMT

Songs used by ChatGPT included Herbert Grönemeyer's 1984 synth-pop sendup of masculinity, ' (Men). Songs used by ChatGPT included Herbert Grönemeyer's 1984 synth-pop sendup of masculinity, ' (Men). OpenAI ordered to pay undisclosed damages for training its language models on artists' work without permission The Munich regional court sided in favour of Germany's music rights society GEMA, which said ChatGPT had harvested protected lyrics by popular artists to "learn" from them. The collecting society GEMA, which manages the rights of composers, lyricists and music publishers and has approximately 100,000 members, filed the case against OpenAI in November 2024. The lawsuit was seen as a key European test case in a campaign to stop AI scraping of creative output.

large language model, machine learning, natural language, (18 more...)

The Guardian

Country: Europe > Germany > Bavaria > Upper Bavaria > Munich (0.27)

Industry:

Leisure & Entertainment > Sports (0.99)
Law > Litigation (0.71)
Government > Regional Government > Europe Government > Germany Government (0.41)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.55)

Add feedback

Language models for longitudinal analysis of abusive content in Billboard Music Charts

Chandra, Rohitash, Suresh, Yathin, Sinha, Divyansh Raj, Jindal, Sanchit

arXiv.org Artificial IntelligenceOct-9-2025

There is no doubt that there has been a drastic increase in abusive and sexually explicit content in music, particularly in Billboard Music Charts. However, there is a lack of studies that validate the trend for effective policy development, as such content has harmful behavioural changes in children and youths. In this study, we utilise deep learning methods to analyse songs (lyrics) from Billboard Charts of the United States in the last seven decades. We provide a longitudinal study using deep learning and language models and review the evolution of content using sentiment analysis and abuse detection, including sexually explicit content. Our results show a significant rise in explicit content in popular music from 1990 onwards. Furthermore, we find an increasing prevalence of songs with lyrics containing profane, sexually explicit, and otherwise inappropriate language. The longitudinal analysis of the ability of language models to capture nuanced patterns in lyrical content, reflecting shifts in societal norms and language use over time.

large language model, lyric, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2510.06266

Country:

North America > United States (0.66)
Europe (0.46)

Genre: Research Report > New Finding (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

From Joy to Fear: A Benchmark of Emotion Estimation in Pop Song Lyrics

Dahary, Shay, Edana, Avi, Apartsin, Alexander, Aperstein, Yehudit

arXiv.org Artificial IntelligenceSep-9-2025

The emotional content of song lyrics plays a pivotal role in shaping listener experiences and influencing musical preferences. This paper investigates the task of multi-label emotional attribution of song lyrics by predicting six emotional intensity scores corresponding to six fundamental emotions. A manually labeled dataset is constructed using a mean opinion score (MOS) approach, which aggregates annotations from multiple human raters to ensure reliable ground-truth labels. Leveraging this dataset, we conduct a comprehensive evaluation of several publicly available large language models (LLMs) under zero-shot scenarios. Additionally, we fine-tune a BERT-based model specifically for predicting multi-label emotion scores. Experimental results reveal the relative strengths and limitations of zero-shot and fine-tuned models in capturing the nuanced emotional content of lyrics. Our findings highlight the potential of LLMs for emotion recognition in creative texts, providing insights into model selection strategies for emotion-based music information retrieval applications. The labeled dataset is available at https://github.com/LLM-HITCS25S/LyricsEmotionAttribution.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2509.05617

Country: Asia > Middle East > Israel (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ArzEn-MultiGenre: An aligned parallel dataset of Egyptian Arabic song lyrics, novels, and subtitles, with English translations

Al-Sabbagh, Rania

arXiv.org Artificial IntelligenceAug-5-2025

This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/) 2 R. Al-Sabbagh / Data in Brief 54 (2024) 1 10271 Subject Computer Science, Social Sciences Specific subject area Natural Language Processing, machine translation, large-language models, translation studies, cross-linguistic analysis, lexical semantics Data format Translated and aligned Type of data Texts (Bilingual tables in Microsoft Excel files) Data collection The ArzEn-MultiGenre dataset consists of three genres: song lyrics, novels, and subtitles. The data was gathered from various sources using different methods. A website was crawled for song lyrics using an in-house web crawler, and professional translators manually translated the lyrics into English. For novels, hard copies were collected in English and Egyptian Arabic, then scanned and converted into text files using an Optical Character Recognizer (OCR). The OCR output was then manually reviewed and aligned.

artificial intelligence, machine translation, natural language, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.dib.2024.110271

2508.01411

Country: Asia > Middle East > UAE (0.14)

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment (1.00)
Media > Music (0.93)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Beyond English: Unveiling Multilingual Bias in LLM Copyright Compliance

Chen, Yupeng, Zhang, Xiaoyu, Huang, Yixian, Xie, Qian

arXiv.org Artificial IntelligenceFeb-14-2025

Large Language Models (LLMs) have raised significant concerns regarding the fair use of copyright-protected content. While prior studies have examined the extent to which LLMs reproduce copyrighted materials, they have predominantly focused on English, neglecting multilingual dimensions of copyright protection. In this work, we investigate multilingual biases in LLM copyright protection by addressing two key questions: (1) Do LLMs exhibit bias in protecting copyrighted works across languages? (2) Is it easier to elicit copyrighted content using prompts in specific languages? To explore these questions, we construct a dataset of popular song lyrics in English, French, Chinese, and Korean and systematically probe seven LLMs using prompts in these languages. Our findings reveal significant imbalances in LLMs' handling of copyrighted content, both in terms of the language of the copyrighted material and the language of the prompt. These results highlight the need for further research and development of more robust, language-agnostic copyright protection mechanisms to ensure fair and consistent protection across languages.

ground truth, language model, lyric, (15 more...)

arXiv.org Artificial Intelligence

2503.05713

Country:

North America > United States > California (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Switzerland > Geneva > Geneva (0.04)
(3 more...)

Genre: Research Report > New Finding (0.88)

Industry: Law > Intellectual Property & Technology Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Anthropic agrees to work with music publishers to prevent copyright infringement

EngadgetJan-3-2025, 15:47:43 GMT

Anthropic has partly resolved a legal disagreement that saw the AI startup draw the ire of the music industry. The group alleged that the company had trained its Claude AI model on at least 500 songs to which they held rights and that, when promoted, Claude could reproduce the lyrics of those tracks either partially or in full. Among the song lyrics the publishers said Anthropic had infringed on included Beyoncé's "Halo" and "Moves Like Jagger" by Maroon 5. In cases where the company intends not to address an issue, it must clearly state its intent to do so. "Our decision to enter into this stipulation is consistent with those priorities.

anthropic, infringement, music publisher, (6 more...)

Engadget

Industry:

Law > Intellectual Property & Technology Law (1.00)
Media > Music (1.00)

Technology: Information Technology > Artificial Intelligence (0.66)

Add feedback

Beats of Bias: Analyzing Lyrics with Topic Modeling and Gender Bias Measurements

Chen, Danqing, Satish, Adithi, Khanbayov, Rasul, Schuster, Carolin M., Groh, Georg

arXiv.org Artificial IntelligenceSep-24-2024

This paper uses topic modeling and bias measurement techniques to analyze and determine gender bias in English song lyrics. We utilize BERTopic to cluster 537,553 English songs into distinct topics and chart their development over time. Our analysis shows the thematic shift in song lyrics over the years, from themes of romance to the increasing sexualization of women in songs. We observe large amounts of profanity and misogynistic lyrics on various topics, especially in the overall biggest cluster. Furthermore, to analyze gender bias across topics and genres, we employ the Single Category Word Embedding Association Test (SC-WEAT) to compute bias scores for the word embeddings trained on the most popular topics as well as for each genre. We find that words related to intelligence and strength tend to show a male bias across genres, as opposed to appearance and weakness words, which are more female-biased; however, a closer look also reveals differences in biases across topics.

gender bia, lyric, sc-weat score, (13 more...)

arXiv.org Artificial Intelligence

2409.15949

Country:

South America > Uruguay > Maldonado > Maldonado (0.04)
North America > United States > Michigan (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report (0.84)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

SONICS: Synthetic Or Not -- Identifying Counterfeit Songs

Rahman, Md Awsafur, Hakim, Zaber Ibn Abdul, Sarker, Najibul Haque, Paul, Bishmoy, Fattah, Shaikh Anowarul

arXiv.org Artificial IntelligenceAug-27-2024

The recent surge in AI-generated songs presents exciting possibilities and challenges. While these tools democratize music creation, they also necessitate the ability to distinguish between human-composed and AI-generated songs for safeguarding artistic integrity and content curation. Existing research and datasets in fake song detection only focus on singing voice deepfake detection (SVDD), where the vocals are AI-generated but the instrumental music is sourced from real songs. However, this approach is inadequate for contemporary end-to-end AI-generated songs where all components (vocals, lyrics, music, and style) could be AI-generated. Additionally, existing datasets lack lyrics-music diversity, long-duration songs, and open fake songs. To address these gaps, we introduce SONICS, a novel dataset for end-to-end Synthetic Song Detection (SSD), comprising over 97k songs with over 49k synthetic songs from popular platforms like Suno and Udio. Furthermore, we highlight the importance of modeling long-range temporal dependencies in songs for effective authenticity detection, an aspect overlooked in existing methods. To capture these patterns, we propose a novel model, SpecTTTra, that is up to 3 times faster and 6 times more memory efficient compared to popular CNN and Transformer-based models while maintaining competitive performance. Finally, we offer both AI-based and Human evaluation benchmarks, addressing another deficiency in current research.

dataset, lyric, movie, (17 more...)

arXiv.org Artificial Intelligence

2408.1408

Country:

North America > United States > Indiana (0.04)
Europe > United Kingdom > England > Greater London > London > Wimbledon (0.04)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)
(18 more...)

Genre: Research Report (1.00)

Industry:

Media > Music (1.00)
Media > Film (1.00)
Leisure & Entertainment > Sports (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Towards Estimating Personal Values in Song Lyrics

Demetriou, Andrew M., Kim, Jaehun, Manolios, Sandy, Liem, Cynthia C. S.

arXiv.org Artificial IntelligenceAug-22-2024

Most music widely consumed in Western Countries contains song lyrics, with U.S. samples reporting almost all of their song libraries contain lyrics. In parallel, social science theory suggests that personal values - the abstract goals that guide our decisions and behaviors - play an important role in communication: we share what is important to us to coordinate efforts, solve problems and meet challenges. Thus, the values communicated in song lyrics may be similar or different to those of the listener, and by extension affect the listener's reaction to the song. This suggests that working towards automated estimation of values in lyrics may assist in downstream MIR tasks, in particular, personalization. However, as highly subjective text, song lyrics present a challenge in terms of sampling songs to be annotated, annotation methods, and in choosing a method for aggregation. In this project, we take a perspectivist approach, guided by social science theory, to gathering annotations, estimating their quality, and aggregating them. We then compare aggregated ratings to estimates based on pre-trained sentence/word embedding models by employing a validated value dictionary. We discuss conceptually 'fuzzy' solutions to sampling and annotation challenges, promising initial results in annotation quality and in automated estimations, and future directions.

correlation, lyric, music, (15 more...)

arXiv.org Artificial Intelligence

2408.12694

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > United Kingdom (0.14)
Europe > Austria > Vienna (0.14)
(8 more...)

Genre: Research Report (0.83)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.68)

Add feedback